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Abstract 

The structural origin of the size of the 11 nm nucleosomal disc is addressed. On the nanometer 
length-scale the organization of DNA as chromatin in the chromosomes involves a coiling of DNA 
around the histone core of the nucleosome. We suggest that the size of the nucleosome core particle is 
dictated by the fulfillment of two criteria: One is optimizing the volume fraction of the DNA double 
helix; this requirement for close-packing has its root in optimizing atomic and molecular interactions. 
The other criterion being that of having a zero strain-twist coupling; being a zero-twist structure is a 
necessity when allowing for transient tensile stresses during the reorganization of DNA, e.g., during 
the reposition, or sliding, of a nucleosome along the DNA double helix. The mathematical model we 
apply is based on a tubular description of double helices assuming hard walls. When the base-pairs of 
the linker-DNA is included the estimate of the size of an ideal nucleosome is in close agreement with 
the experimental numbers. Interestingly, the size of the nucleosome is shown to be a consequence of 
intrinsic properties of the DNA double helix. 



Introduction 

The packing of DNA and histones into nucleosomes is the first level of compaction of the deoxyribonu- 
cleic acid of the chromosomes. The nucleosomes form the fundamental repeating building structure of 
chromatin, each incorporating a stretch of about two- hundred consecutive base pairs pQ. A nucleosome 
consist of the core nucleosome particle with 146/147 bp and a linker-DNA of variable length, typically 
with about 50 bp [2J. Crystallographic studies have revealed the detailed geometry of the nucleosome 
core particle [3HH[5]. An octameric disc is formed of the alkaline proteins H2A, H2B, H3 and H4; on this 
disc the double stranded helical DNA is coiled ~ 1.7 times in a nearly flat left-handed super helix. Cells 
in the human body contain 2 x 23 chromosomes with perhaps as many as 4 x 10 7 nucleosomes in total, 
and most remarkable these four histones are nearly conserved across all eukaryotic cells. Even in cases 
with significant sequence differences they remain structurally exceptionally similar [5J. A typical yeast 
genome contains about 6 x 10 4 nucleosomes [7J. 

Structurally, the nucleosomes support an efficient compaction of DNA [8] . Electrostatic interactions 
between DNA and the histones contribute to the stability of the nucleosome [HI HO] > as well as of any 
formed superstructures such as the 30 nm filament [TTJ [TH [T3] • During the last decade, or two, it has 
become clear that the functional roles of chromatin/nucleosomes/histones are much richer than being 
limited to that of compaction [TH [T3 US] . Post-translational biochemical processes [T7J EH HH1 [Ml EH] i 
e.g. methylation, ubiquitination, and acetylation, as well as nucleosome-positioning proteins |22| can have 
profound and decisive impact on regulating phenomena such as transcription, recombination, and repair. 
Biochemical processes may also control phenomena such the sliding of the nucleosomes along the DNA 
[2"31 |2"4] and of possible histone eviction [25] . It was early recognized that the DNA of the nucleosome has 
a modified number of base pairs per full rotation (25|[23[2E]- The average number is about 10.1 bp/27r 
for nucleosomal DNA in comparison with 10.5 bp/27r for B-DNA. 

The linking number of a set of two closed space curves is a conserved integer. White's theorem |2"5] 
equate the topological linking number with the sum of two geometrical measures - the writhing and the 
total twist. For a review of the relevance of the linking number for biomolecules and of knot theory see 



ref. [SU] • In practice the linking is also conserved for a local stretch of a curve when the full curve extends 
too far for it to globally reorganize. An example of such a local event is the eviction of the histones 
associated with the dissolution of one nucleotide [25]. Models of the chromatin fiber have incorporated 
simplified interactions between nucleosome core particles |31) . Recently, the understanding of linking as 
well as of steric interactions have been further embraced in proposed detailed models for the chromatin 
fiber [32]. Furthermore, models of the chromatin fiber has incorporated detailed molecular force fields 
[33]. 

In this paper we address the size of the nucleosome with its structurally highly conserved core. It is 
interesting to inquire into why the nucleosome core particles have the size they have, and to know the 
answers to related questions. For example, would it at all be possible for the nucleosomes to be twice 
the size? We propose that the DNA of the nucleosomes must obey certain criteria and show that the 
combination of two criteria leads to a unique requirement for the size of an ideal nucleosome. This can 
explain why the size of the nucleosomal disc is highly conserved throughout the eukaryotes. 

Model 

In the following we define two criteria, one for the twisting behavior and one for the packing properties 
and suggest that nucleosomal DNA obeys both of the criteria. We then model an ideal nucleosome as 
one that obeys the two criteria simultaneously. To find the size of an ideal nucleosome it is necessary 
to understand how the two criteria changes with the superhelical curvature of the DNA double helix, 
i.e. with bending of the double helix. To demonstrate this behavior we calculate the results for tubular 
double helices. This simplification has worked well in the past for generic a-hcliccs, and for A- and 
B-DNA where accurate theoretical estimates of the pitch angles of experimental structures are obtained 

ED- 

Here the criteria are discussed in connection with their application with long polymers: 

The Zero-Twist Criterion. When long structures are under longitudinal strain they tend to twist. 
Certain structures have a vanishing strain-twist coupling, we denote such structures as zero-twist struc- 
tures. Fascinatingly, even chiral structures can be zero-twist structures 135\/ . The zero-twist criterion 
simply requires structures to be zero-twist structures, as this prevents a rotational catastrophe for long 
structures. Recently, we have shown that N -helices with a unique pitch are zero-twist structures \35^ and 
have suggested that it is advantageous for the triple helix of collagen to obey the zero-twist criterion \36f . 

Close-Packing Criterion. Atomic and molecular structures tend to use space efficiently, e.g. a good 
fraction of the elements of the periodic system form close-packed structures, or nearly close-packed struc- 
tures. Recently, we have shown that not all helices use space equally efficient J^[ /; the helices that optimize 
the volume fraction are denoted close-packed helices. The a-helices, A-DNA, and B-DNA have all been 
shown to obey the close-packed criterion In short, it is difficult to increase the effective volume of an 
atomic or a molecular structure even by a relatively tiny fraction. This is the origin of the close-packing 
criterion. 

Being zero-twist and having a conserved linking number both relate to the twisting of a polymer chain. 
However, the two features are distinct from each other. Having a conserved linking number says that 
the oriented space curve of the polymer backbone shall preserve its linking, unless enzymatic cleaving of 
the sugar-phosphate backbone of DNA is active. The difference between zero-twist and conserved linking 
number can be understood when one considers a transformation from molecular path-I to path-II. Such an 
actual transformation will typically involve a transient longitudinal straining of the molecular structure. 
This is where the zero-twist criterion comes into play: It prevents a transient catastrophic twisting of long 
polymers. An example which illustrates this is the sliding (or repositioning) of a nucleosome along the 
double stranded DNA polymer. Such a repositioning of the nucleosome trivially obeys having a conserved 
linking number. This is not so for the zero- twist criterion, generally a strained helical structure will twist, 



2 



i.e. long-ranged and accumulative modifications to the helix will result from an induced longitudinal 
strain. Certain unique structures have zero strain-twist coupling, these are the zero-twist structures that 
can be longitudinally strained without causing long-ranged propagated modifications in twist. 



Results 



First we consider the case of absence of supcrhelical curvature, i.e. a straight section of a double helix. 
Figure 1A shows the calculated volume fraction, fy, as function of the dimensionless value of the double 
helical pitch, H/D. Here, H is the pitch of the double helix and D the diameter of one of the two 
strands of the double helix. For DNA the strand diameter is D w 11 A |34j . The maximum value of the 
volume fraction, see Figure 1A, is obtained for H/D=2.399, this double helix obeys the close-packing 
criterion. Likewise, the twist, 0, of a straight section of a double helix can be calculated. It is more 
useful to calculate the twist normalized by the dimensionless length of the strands, DQ/L. Figure IB 
depicts DQ/L as a function of H/D. The twist is the incremental rotation of the double helix as one 
progresses along one of the two strands. The maximum appears at H / D— 2.697, this double helix obeys 
the zero-twist criterion. 
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Figure 1: (A) The volume fraction fv for a symmetric tubular double helix as a function of the dimen- 
sionless pitch, H/D, where H is the pitch (the length of a repetitive segment of the double helix), and 
D the diameter of one of the tubular strands. The maximum value of the volume fraction is 0.769 for 
H/D = 2.399. This is the close-packed double helix with a pitch angle of 32.5° [SJ^l. (B) The rate of 
change, DQ/L, in the twist of a tubular double helix per unit increment of the dimensionless length of 
the strands, L/D, depicted as a function of the dimensionless pitch, H/D, of the helix. The maximum is 
at H/D = 2.697. 



Therefore it is not possible for a straight double helix to simultaneously obey the zero-twist criterion 
(response to strain) and the close-packing criterion (molecular forces). A compaction of DNA that 
involves long straight sections of DNA would therefore exhibit a rotational catastrophe under tensile 
stress. However, the nucleosomes bend the DNA and thus introduce a curvature to the double helix as 
the double helix is coiled around the histone proteins. As curvature is introduced, the steric interactions 
between the two individual strands of the double helix changes. This means that the curves depicted 
in Figure 1A and Figure IB needs to be recalculated for each value of the curvature of the center line 
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of the double-stranded DNA molecule. Hence, the pitch of the helix for which the zero-twist criterion 
is fulfilled changes with the amount of introduced curvature, correspondingly so for the close-packing 
criterion. Figure 2 shows the pitch, H/D, of the zero-twist and close-packed double helices as a function 
of D/R. The radius of curvature, R, is the radius of the central line of the double helix of nucleosomal 
DNA. A straight segment of DNA has R = oo. For both the zero- twist helix and the close-packed helix 
the helical pitch is increased when bending is enforced, though more rapidly so for the close-packed helix. 
The result is that for a curvature of D/ R=0.391 the two criteria can be simultaneously obeyed. We 
suggest that this is the defining science behind the size of the nucleosome. We therefore estimate the size 
of such an ideal nucleosome to be 2R = 56 A. 
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Figure 2: The blue curve shows the value of the helical pitch that obeys the zero-twist criterion as a 
function of the dimensionless curvature, D/R, and the red curve shows the value of the helical pitch for 
the curved helices that obey the close-packed criterion as a function of curvature, D/R. As can be seen 
the two curves meets at a value of D/R — 0.391, with D rj 11 A this corresponds to 2R = 56 A. The 
torus double helix shown has D/R = 0.391 and is therefore both close-packed and zero-twist. 

The above results only holds for symmetric double helices. B-DNA is asymmetric with 144° between 
the strands towards the minor groove, and 216° for the major groove |27) . The presented calculation 
for symmetric double helices therefore needs to be modified with steric restrictions for the asymmetric 
double helix. Figure 3A shows the volume fraction, fy, for an asymmetric helix and Figure 3B shows the 
rate of increment in twist, DO/L, for an asymmetric double helix in the absence of curvature. Figure 
4 shows the dimensionless pitch, H/D, of the asymmetric B-DNA like double helix as a function of 
introduced curvature, D/R. The results is now that for a curvature of D/R = 0.141 the two criteria can 
be simultaneously obeyed. This leads to an estimate for 2R = 156 A. 

We now take into account that the nucleosome core particle comes with an additional stretch of B- 
DNA, often denoted the linker-DNA. If one makes the requirement that the nucleosome core particle 
together with the linker-DNA behaves as a zero-twist structure then the nucleosome core particle needs 
to properly compensate for the behavior of the linker-DNA. I.e. the nucleosome core particle needs to 
have a further reduced radius. B-DNA has a incremental twist of D 2 d^/dH = 0.135 at the close-packed 
structure. If we assume that the linker-DNA is about 50 bp then the DNA of the nucleosome core particle 
needs to have an incremental twist of D 2 d®/dH — —(50/147)0.135 = —0.046. This requirement leads 
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Figure 3: (A) The volume fraction fy for an asymmetric (minor groove 144°^) tubular double helix as 
a function of the dimensionless pitch, H/D, where H is the pitch (the length of a repetitive segment of 
the double helix), and D the diameter of one of the tubular strands. The maximum value of the volume 
fraction appears for H/D = 3.232. This is the close-packed double helix with a pitch angle of 38.1° . In 
ref. a value 142° was used for the minor groove and therefore the obtained pitch angle was 38.5° 

instead o/38.1°. (B) The rate of change, DQ/L, in the twist of an asymmetric (minor groove 144°) 
tubular double helix per unit increment of the dimensionless length of the strands, L/D, depicted as a 
function of the dimensionless pitch, H/D, of the helix. The maximum value is at H/D = 3.452. 



to D/R — 0.221 and a diameter of 2R = 100 A. For this particular curvature of DNA the twisting, 
and counter-twisting, of the linker-DNA and of the nucleosomal core particle DNA precisely cancel. 
Table 1 summarizes the scale invariant parameters for the three presented nucleosomal models. Three 
parameters fully describe a structure, the fourth parameter is given for convenience. Models of chromatin 
often incorporate a relatively large range for the length of the linker-DNA (30-90 bp) [32l [33] ■ If we use 
the less common value 70 bp we find 2R — 83 A and a disc diameter of 11 nm. 



Table 1: Calculated values for the scale invariant variables describing the bent symmetric double helices 
(A-DNA ) which has both close-packed ( CP) and zero-twist (ZT) properties. Same for the bent asymmetric 
double helix (B-DNA), here the phase of the minor groove is set at 144° [271. Lastly, is listed the values 
for the curved asymmetric double helices (B-DNA) which compensates the twisting such that it precisely 
cancels a 50 bp stretch of linker-DNA. 



Model 


Description 


D/R 


H/D 


2a/D 




Model 1 
Model 2 
Model 3 


CP & ZT curved A-DNA 
CP & ZT curved B-DNA 
CP k compensating B-DNA 


0.391 
0.141 
0.221 


3.212 
3.696 
3.976 


1.029 
1.237 
1.203 


4.569 
5.364 
5.493 
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Figure 4: TTie curve shows the value of the helical pitch of helices that obey the zero-twist criterion 
as a function of the dimensionless curvature, D/R, and the red curve shows the value of the helical pitch 
for the curved helices that obey the close-packing criterion as a function of curvature, D/R. As can be 
seen the two curves meets at a value of D / R = 0.141, with D = 11 A this corresponds to 2R — 156 A. 
The asymmetric double helix also has D/R — 0.141 and is therefore both close-packed and zero-twist. 



Discussion 

Measuring the distance from the centerline of the DNA helix of the nucleosome core particle PDB entry 
3LZ0 (37] we find 2R PDB ~ 83 A. Table 2 compares the values of 2R from crystallographic studies 
with the three models discussed above and enlisted in Table 1. The value for the compensating model 
2R = 100 A is in good agreement with the experimental value of 83-84 A. The outer diameter of the 
nucleosomal disc for this model is 2R + 2a + D = 12 nm, in fair agreement with the well established size 
of the nucleosome disc, i.e. 11 nm. 

When one bends a double helix, as is the case when wrapping it around the histone proteins, curvature 
is introduced and the steric interactions are modified compared to those in a straight double helical 
segment. This modifies the number of base pairs per full rotation. Such a change in the number of base 
pairs per rotation was first observed in theoretical studies using empirical energy functions [26] ■ Figure 



Table 2: Comparison of experimental and calculated values for 2R for nucleosomal DNA and for the 
diameter of the nucleosome disc 2R + 2a + D. Model 1 is for a symmetric double helix (A-DNA). Model 
2 is for an asymmetric double helix (B-DNA). Model 3 is for an asymmetric helix including the effect 
of a 50 bp long linker-DNA. The numbers for 2R are given in units of A while the numbers for the disc 
size, 2R + 2a + D are in units of nm. 







3ZL0 


1ZLA 


Model 1 


Model 2 


Model 3 


Nucleosomal DNA 


2R (A) 


83 


84 


56 


156 


100 


Size of the nucleosome 2R + 2a - 


- D (nm) 


11 


11 


8 


18 


12 
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5 depicts the number of base pairs per full 2ir rotations for the asymmetric tube model as a function 
of dimensionless curvature D/R. As can be seen for the model 3, D/R = 0.221, we predict 10.0 bp/27r 
slightly below the experimentally known figure of 10.1 bp/27r. The discussion shows the steric interactions 
within the double helix causes DNA to twist further when it is bent, i.e. to have fewer base pairs per 
full rotation. This is why the superhelix of the nucleosomal DNA is left-handed, as this handedness 
helps to compensate for the change in linking. In other words, had the nucleosomal DNA superhelix 
been right-handed then there would have been a serious linking number discrepancy associated with the 
evictions of the histones. 

11.0 
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Figure 5: The estimated number of base pairs per full 2tt rotation for the curved asymmetrical tubular 
double helix. Normalized such that there is 10.5 bp/2ir for zero curvature. I.e. for B-DNA Li^ / 1 D = 5.238 
with 10.5 bp/2n. 

By combining two criteria - to be close-packed and to be a zero-twist structure - we have come to the 
suggestion that the DNA of the nucleosome must have a specific curvature. Therefore, neither with a core 
of histones twice the size, nor with a core half the size, would the nucleosomal construction have worked. 
This is consistent with the octameric disc of the H2A, H2B, H3 and H4 histones being structurally highly 
conserved. Furthermore, the use of a tubular model reaches a number for the size which agrees with 
the scale of known size of the nucleosomes from the PDB data base, and predict the approximately 
observed change in number of base pairs per full rotation. Considering the geometrical idealization of 
using a tubular model the agreement is remarkable. It is interesting to note that this means that the two 
features that determine the size of the nucleosome when working in congruity both are intrinsic features 
of the double helix of DNA. Hence the presented results could not have been obtained by modeling the 
double helical DNA as a single tube. There is currently much progress in experimental studies of tension 
and torsion of chromatin, for a review see [41] . and presumably studies on single nucleosome core particles 
will soon be performed. 

Methods: 

We utilize a tubular model and assume in all calculations that the helix is formed of two flexible tubular 
strands with hard walls which are in contact with each other. These contacts define the steric hinderance, 
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and their solutions are found numerically which involves solving transcendental equations. For a detailed 
description of how we solve the condition of inter-strand contact, see ref. |34) . wherein are also described 
how the asymmetry of B-DNA is handled in tubular models. For references to earlier work on how to 
solve for inter-strand contacts see ref. [351 [Ml SD] ■ 

The twist, of straight section of a double helix formed from two tubular strands each being L long 
can be calculated as 



Here H is the pitch of the double helix and a the radius of the imaginary cylinder which hosts the 
parametric representation of the strands. In the following equations D is the diameter of one of the two 
flexible tubes. 

We define a zero-twist structure as one that obeys, 



Therefore, a zero-twist structure is one that neither rotates clockwise nor anti-clockwise when under 
tensile strain. 

The volume fraction of a double helix, fv(H/D), is easily calculated |34) . It describes the effectiveness 
of the use of space calculated relatively to an enclosing cylinder, 



where Vs is the volume of a strand being L long, Ve is the volume of the smallest cylinder which 
circumscribes the double helix section. For a bent double helix, the reference volume becomes a section 
of a torus instead of a cylinder. 

The criterion for being close-packed is that the volume fraction, fy, is at its maximum. The maximum 
point can be found by considering, 



D±tfr) = (4) 



Expressed in terms of the parameters H, a, and D the volume fraction becomes 



2a \ f ,2ira 



1/2 



For the case of a curved section of a double tubular helix the continuous symmetry of the inter-strand 
contacts point is broken. This means that the inter-strand contact points no longer form helical lines but 
now appears in discrete sets of points. For the calculation we have consistently used the most restrictive 
condition of inter-strand contacts. This condition appears when the minor groove coincides with the 
inside of the equatorial plane of the bent double helix. 

Consistent with this, the twist, 9, of a bent double helix is 




s 



where R is the radius of the centerline of the bent double helix. Similarly, the volume fraction of the 
bent double helix is 

*-<i + ^;r( ( £ ),+(1 -5~ ,), f * (7) 
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